Time- and memory-efficient genome assembly with Raven

نویسندگان

چکیده

Whole genome sequencing technologies are unable to invariably read DNA molecules intact, a shortcoming that assemblers try resolve by stitching the obtained fragments back together. Here, we present methods for improvement of de novo assembly from erroneous long reads incorporated into tool called Raven. Raven maintains similar performance various genomes and has accuracy on par with other support third-generation data. It is one fastest options while having lowest memory consumption majority benchmarked datasets. designed democratize assembly, being simple efficient keeping high accuracy. Using method detection false overlaps based graph drawing, it can be employed sizes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel and Memory-Efficient Reads Indexing for Genome Assembly

As genomes, transcriptomes and meta-genomes are being sequenced at a faster pace than ever, there is a pressing need for e cient genome assembly methods. Two practical issues in assembly are heavy memory usage and long execution time during the read indexing phase. In this article, a parallel and memory-e cient method is proposed for reads indexing prior to assembly. Speci cally, a hash-based s...

متن کامل

SparseAssembler2: Sparse k-mer Graph for Memory Efficient Genome Assembly

Motivation: To tackle the problem of huge memory usage associated with de Bruijn graph-based algorithms, upon which some of the most widely used de novo genome assemblers have been built, we released SparseAssembler1. SparseAssembler1 can save as much as 90% memory consumption in comparison with the state-of-art assemblers, but it requires rounds of denoising to accurately assemble genomes. Alg...

متن کامل

Efficient Synergistic Single-Cell Genome Assembly

As the vast majority of all microbes are unculturable, single-cell sequencing has become a significant method to gain insight into microbial physiology. Single-cell sequencing methods, currently powered by multiple displacement genome amplification (MDA), have passed important milestones such as finishing and closing the genome of a prokaryote. However, the quality and reliability of genome ass...

متن کامل

RAVEN: Real-Time Analyzing and Verification Environment

In this paper we present the real-time verification and analysis tool RAVEN. RAVEN is developed for verifying timed systems on various levels of abstraction. It integrates a real-time model checker for real-time specifications, it offers algorithms for analyzing critical delay times, for inspecting data values and event occurrences and for detecting dead-locks and live-locks. The counter exampl...

متن کامل

Memory-Efficient Backpropagation Through Time

We propose a novel approach to reduce memory consumption of the backpropagation through time (BPTT) algorithm when training recurrent neural networks (RNNs). Our approach uses dynamic programming to balance a trade-off between caching of intermediate results and recomputation. The algorithm is capable of tightly fitting within almost any user-set memory budget while finding an optimal execution...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Nature Computational Science

سال: 2021

ISSN: ['2662-8457']

DOI: https://doi.org/10.1038/s43588-021-00073-4